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DETAILED ACTION 

1 . A request for continued examination under 37 CFR 1.114, including the fee set forth in 
37 CFR 1 .17(e), was filed in this application after final rejection. Since this application is 
eligible for continued examination under 37 CFR 1.1 14, and the fee set forth in 37 CFR 1.17(e) 
has been timely paid, the finality of the previous Office action has been withdrawn pursuant to 
37 CFR 1.114. Applicant's submission filed on August 4, 2005 has been entered. Claims 1, 3, 
6-8, 10-15, 18 and 20-26 are now pending. 

Response to Arguments 

2. Applicant's arguments have been fully considered but they are not persuasive. 

Applicant again contends that neither Poulsen nor Sundaresan teaches or suggests 
translating a program unit into two different program units (Applicant's remarks, page 6, third 
paragraph). 

However, Poulsen does indeed teach translating the program 100 (i.e., "a first program 
unit") into a translated parallel computer program 130 (see, for example, column 8, lines 32-35) 
that includes two different program units. Specifically, Poulsen discloses translating global 
storage objects into privatizable storage object declarations (i.e., "a third program unit") that 
encapsulate the global storage objects (see, for example, column 8, lines 59-61), and translating 
parallel regions into library calls (i.e., "a second program unit") that reference the privatizable 
storage object declarations (see, for example, column 9, lines 2-12). 

Applicant apparently suggests that a "program unit" is necessarily a complete and 
separate computer program such that the "single translated computer program 130" of Poulsen 
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cannot include two different "program units" (Applicant's remarks, page 6, third paragraph). 
However, a "program unit" is reasonably interpreted as some "unit" of a program, and the plain 
language of the claims does not suggest otherwise. Limitations from the specification are not 
read into the claims. See In re Van Geum, 988 F.2d 1 181, 26 USPQ2d 1057 (Fed. Cir. 1993). 
Moreover, Applicant's specification indicates that a "program unit" is merely "a collection of 
statements in a programming language" (specification, page 7, lines 13-15). In Poulsen, the 
library calls in the translated program are one collection of statements, and the declarations for 
the privatizable storage objects are another collection of statements. Thus, Poulsen teaches 
translating a first program unit into second and third program units. 

Applicant further contends that nowhere do the references teach or suggest translation of 
the first program unit into a second program unit that references a third program unit, where the 
third program unit encapsulates a reduction operation, and that nowhere does Poulsen teach or 
suggest encapsulation of a reduction operation (Applicant's remarks, page 6, last paragraph). 

However, one cannot show nonobviousness by attacking references individually where 
the rejections are based on combinations of references. See In re Keller, 642 F.2d 413, 208 
USPQ 871 (CCPA 1981), and In re Merck & Co., 800 F.2d 1091, 231 USPQ 375 (Fed. Cir. 
1986). As noted above, Poulsen teaches translating the program (i.e., "the first program unit") 
into library calls (i.e., "a second program unit") that reference privatizable storage object 
declarations (i.e., "a third program unit"). In other words, Poulsen teaches translating the first 
program unit into a second program unit that references a third program unit. Poulsen discloses 
that the third program unit encapsulates a global storage object (see, for example, 8, lines 59-61), 
but does not expressly disclose that it encapsulates a reduction operation. Nonetheless, 
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Sundaresan teaches a reusable reduction object that does encapsulate a reduction operation (see, 
for example, column 5, lines 7-14). Thus, Poulsen in view of Sundaresan teaches that the third 
program unit encapsulates a reduction operation. 

Claim Rejections - 35 {JSC §103 

3. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 

obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

4. Claims 1, 3, 6-8, 10-15, 18, 20, 21, 23, and 26 are rejected under 35 U.S.C. 103(a) as 
being unpatentable over U.S. Patent No. 5,812,852 to Poulsen et al. (art of record, "Poulsen") in 
view of U.S. Patent No. 5,937,194 to Sundaresan (art of record, "Sundaresan"). 

With respect to claim 1 (currently amended), Poulsen discloses a method comprising: 

(a) receiving a first program unit in a parallel computing environment (see, for example, 
column 8, lines 29-30, which shows receiving a first parallel computer program unit). 

Although Poulsen discloses that the first program unit includes parallel regions and 
global storage objects (see, for example, column 8, lines 29-30), Poulsen does not expressly 
disclose the limitation wherein the first program unit includes a reduction operation associated 
with a set of variables. 

However, Sundaresan discloses a reduction operation associated with a set of values or 
variables, wherein the reduction operation performs an algebraic operation on the values or 
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variables and is partitioned among a plurality of threads (see, for example, column 7, lines 13-16, 
and column 1, lines 59-63). The reduction operation in Sundaresan is encapsulated in reusable 
reduction objects so as to improve the expressibility and maintainability of parallel code (see, for 
example, column 5, lines 7-14, 21-23 and 30-33). 

It would have been obvious to one of ordinary skill in the art at the time the invention 
was made to modify the system of Poulsen to include a reduction operation, such as with the 
reusable reduction objects taught by Sundaresan, for the purpose of improving the expressibility 
and maintainability of the parallel computer program. 

Therefore, Poulsen in view of Sundaresan discloses receiving a first program unit in a 
parallel computing environment, the first program unit including a reduction operation associated 
with a set of variables. 

Poulsen in view of Sundaresan further discloses: 

(b) translating the first program unit into a second program unit, the second program unit 
including a set of one or more instructions to partition the reduction operation between a 
plurality of threads including at least two threads and to reference a third program unit (see, for 
example, Poulsen, column 8, lines 32-35, which shows translating the program, and column 9, 
lines 2-12, which further shows translating parallel regions into library calls or a second program 
unit that references privatizable storage object declarations or a third program unit, and see, for 
example, Sundaresan, column 7, lines 13-16, which shows that the reduction operation is 
partitioned among a plurality of threads, as presented above); and 

(c) translating the first program unit into the third program unit, the third program unit 
including a set of one or more instructions that encapsulate the reduction operation to perform an 
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algebraic operation on the variables (see, for example, Poulsen, column 8, lines 32-35, which 
shows translating the program, and column 8, lines 59-61, which further shows translating global 
storage objects into privatizable storage object declarations or a third program unit that 
encapsulates the global storage objects, and see, for example, Sundaresan, column 7, lines 13-16, 
and column 1, lines 59-63, which shows that the reduction operation performs an algebraic 
operation on the values or variables, as presented above, and column 5, lines 7-14, which further 
shows that the reduction operation is encapsulated in a reduction object). 

With respect to claim 3 (currently amended), Poulsen in view of Sundaresan further 
discloses reducing the set of variables logarithmically (see, for example, Sundaresan, column 7, 
lines 16-18, which shows that the reduction operation reduces the values or variables 
logarithmically). 

With respect to claim 6 (original), Poulsen in view Sundaresan further discloses 
associating the plurality of threads each with a unique portion of the set of variables (see, for 
example, Sundaresan, column 7, lines 20-21, which shows that the reduction operation associates 
individual values or variables to each of the threads). 

With respect to claim 7 (original), Poulsen in view of Sundaresan further discloses 
combining, in part, the variables associated with the plurality of threads in a pair-wise reduction 
operation (see, for example, Sundaresan, column 1 1, line 48 to column 12, line 7, which shows a 
sample reduction operation that combines the values or variables associated with the plurality of 
threads in a pair-wise reduction operation, wherein a given thread has a fan-in of two threads, 
which is to say a pair of threads). 
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With respect to claim 8 (currently amended), Poulsen discloses an apparatus comprising: 

(a) a memory including a shared memory location (see, for example, column 8, lines 37- 
39, which shows a memory, and column 7, lines 7-10, which shows a global storage object in a 
shared memory location); 

(b) a translation unit coupled with the memory (see, for example, column 8, lines 32-35, 
which shows a translation means). 

Although Poulsen discloses a first parallel computer program unit (see, for example, 
column 8, lines 29-30) and further discloses translating the first program unit (see, for example, 
column 8, lines 32-35), Poulsen does not expressly disclose the limitation wherein the translation 
unit is to translate a first program unit including a reduction operation associated with a set of at 
least two variables into a second program unit, the second program unit to partition the reduction 
operation between a plurality of threads including at least two threads and to reference a third 
program unit, and wherein the translation unit is to also translate the first program unit into the 
third program unit, the third program unit to encapsulate the reduction operation to perform an 
algebraic operation on the variables. 

However, Sundaresan discloses a reduction operation associated with a set of values or 
variables, wherein the reduction operation performs an algebraic operation on the values or 
variables and is partitioned among a plurality of threads (see, for example, column 7, lines 13-16, 
and column 1, lines 59-63). The reduction operation in Sundaresan is encapsulated in reusable 
reduction objects so as to improve the expressibility and maintainability of parallel code (see, for 
example, column 5, lines 7-14, 21-23 and 30-33). 
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It would have been obvious to one of ordinary skill in the art at the time the invention 
was made to modify the system of Poulsen to include a reduction operation, such as with the 
reusable reduction objects taught by Sundaresan, for the purpose of improving the expressibility 
and maintainability of the parallel computer program. 

Therefore, Poulsen in view of Sundaresan discloses a translation unit coupled with the 
memory, the translation unit to translate a first program unit including a reduction operation 
associated with a set of at least two variables into a second program unit, the second program 
unit to associate the reduction operation with one or more instructions operative to partition the 
reduction operation between a plurality of threads including at least two threads, the translation 
unit to also translate the first program unit into a third program unit, the third program unit to 
associate the reduction operation with a set of one or more instructions operative to perform an 
algebraic operation on the variables (see the rejection of claim 1 above). 

Poulsen in view of Sundaresan further discloses: 

(c) a compiler unit coupled with the translation unit and the memory, the compiler unit to 
compile the second program unit and the third program unit (see, for example, Poulsen, column 
8, lines 42-45, which shows an executable program, which is to say a compiled program, and 
column 13, lines 11-13, which shows that the translation may be integrated with a compiler); and 

(d) a linker unit coupled with the compiler unit and the memory, the linker unit to link the 
compiled second program unit and the compiled third program unit with a library (see, for 
example, Poulsen, column 8, lines 39-42, which shows a linker for linking the program with a 
library). 
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With respect to claim 10 (currently amended), Poulsen in view of Sundaresan further 
discloses the limitation wherein the variables in the set of variables are each uniquely associated 
with the plurality of threads and the library includes instructions to combine, in part, the 
variables associated with the plurality of threads (see, for example, Poulsen, column 10, lines 9- 
1 1 and 15-19, which shows that instructions in the library are called for each parallel region in 
the program, and Sundaresan, column 7, lines 20-21, which shows that the reduction operation 
associates individual values or variables to each of the threads). 

With respect to claim 1 1 (currently amended), Poulsen in view of Sundaresan further 
discloses the limitation wherein the library includes instructions to combine, in part, the variables 
in a pair- wise reduction (see, for example, Sundaresan, column 11, line 48 to column 12, line 7, 
which shows a sample reduction operation that combines the values or variables associated with 
the plurality of threads in a pair-wise reduction operation, wherein a given thread has a fan-in of 
two threads, which is to say a pair of threads). 

With respect to claim 12 (original), Poulsen in view of Sundaresan further discloses a set 
of one or more processors to host the plurality of threads, the plurality of threads to execute 
instructions associated with the second program unit (see, for example, Poulsen, column 6, lines 
46-50, which shows one or more processors for executing the plurality of threads). 

With respect to claim 13 (currently amended), Poulsen in view of Sundaresan further 
discloses the limitation wherein the third program unit includes a callback routine and the 
callback routine is associated with instructions operative to perform the algebraic operation on at 
least two variables in the set of variables (see, for example, Poulsen, column 9, line 63 to column 
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10, line 9, which shows callback routines for the parallel regions in the program, and 
Sundaresan, column 7, lines 13-16, and column 1, lines 59-63, which shows that the reduction 
operation performs an algebraic operation on the values or variables). 

With respect to claim 14 (original), Poulsen in view of Sundaresan further discloses the 
apparatus of claim 13 wherein the library is operative to call the callback routine to perform, in 
part, a reduction on at least two variables in the set of variables (see, for example, Poulsen, 
column 10, lines 9-1 1 and 15-19, which shows that the routines in the library are called for each 
parallel region in the program, and Sundaresan, column 7, lines 13-16, which shows that the 
reduction operation performs a reduction on the values or variables). 

With respect to claim 15 (currently amended), the limitations recited in the claim are 
analogous to the limitations recited in claim 1 (see the rejection of claim 1 above). Poulsen in 
view of Sundaresan further discloses a machine-readable medium that provides instructions, that 
when executed by a set of one or more processors, enable the set of processors to perform the 
recited method (see, for example, Poulsen, column 8, lines 37-39, and column 6, lines 46-50). 

With respect to claim 18 (currently amended), see the rejection of claim 3 above. 

With respect to claim 20 (currently amended), see the rejection of claim 7 above. 

With respect to claim 21 (new), Poulsen in view of Sundaresan further discloses 
performing a plurality of reduction operations in the third program unit (see, for example, 
Sundaresan, column 12, line 64 to column 13, line 2, which shows performing a plurality of 
reduction operations). 
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With respect to claim 23 (new), Poulsen in view of Sundaresan further discloses a run- 
time library to implement the reduction operation (see, for example, Poulsen, column 9, line 63 
to column 10, line 9, which shows a run-time library to implement the parallel operations). 

With respect to claim 26 (new), Poulsen in view of Sundaresan further discloses the 
limitation wherein the third program unit is to perform the algebraic operation using the library 
(see, for example, Poulsen, column 9, line 63 to column 10, line 9, which shows that the parallel 
operations are performed using the library). 

5. Claims 22, 24 and 25 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Poulsen in view of Sundaresan, as applied to claims 1 and 8 above, respectively, and further in 
view of U.S. Patent No. 6,212,617 to Hardwick (now made of record, "Hardwick"). 

With respect to claim 22 (new), Poulsen in view of Sundaresan does not expressly 
disclose performing a vector reduction operation in the third program unit via a N-dimension 
loop in the third program unit. 

However, Hardwick discloses reduction operations that are applied to vectors formed 
from the basic data types (see, for example, column 6, lines 27-39). The vector reduction 
operations are performed via loops in such a manner as to ensure portability across different 
parallel architectures (see, for example, column 6, lines 40-49). 

It would have been obvious to one of ordinary skill in the art at the time the invention 
was made to modify the system of Poulsen and Sundaresan to perform reduction operations on 
vectors via loops in the third program unit, such as taught by Hardwick. One of ordinary skill in 
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the art would have been motivated to extend the type-specific reduction operations of 
Sundaresan (see, for example, column 12, lines 10-15) to further include vectors formed from the 
basic data types. 

With respect to claim 24 (new), although Sundaresan discloses performing a plurality of 
reduction operations (see, for example, column 12, line 64 to column 13, line 2), Poulsen in view 
of Sundaresan does not expressly disclose the limitation wherein the third program unit is to 
perform a plurality of vector operations. 

However, Hardwick discloses reduction operations that are applied to vectors formed 
from the basic data types (see, for example, column 6, lines 27-39). The vector reduction 
operations are performed in such a manner as to ensure portability across different parallel 
architectures (see, for example, column 6, lines 40-49). 

It would have been obvious to one of ordinary skill in the art at the time the invention 
was made to modify the system of Poulsen and Sundaresan to perform reduction operations on 
vectors in the third program unit, such as taught by Hardwick. One of ordinary skill in the art 
would have been motivated to extend the type-specific reduction operations of Sundaresan (see, 
for example, column 12, lines 10-15) to further include vectors formed from the basic data types. 

With respect to claim 25 (new), Poulsen in view of Sundaresan does not expressly 
disclose the limitation wherein the third program unit is to perform a vector reduction operation 
via a N-dimension loop. 

However, Hardwick discloses reduction operations that are applied to vectors formed 
from the basic data types (see, for example, column 6, lines 27-39). The vector reduction 
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operations are performed via loops in such a manner as to ensure portability across different 
parallel architectures (see, for example, column 6, lines 40-49). 

It would have been obvious to one of ordinary skill in the art at the time the invention 
was made to modify the system of Poulsen and Sundaresan to perform reduction operations on 
vectors via loops in the third program unit, such as taught by Hardwick. One of ordinary skill in 
the art would have been motivated to extend the type-specific reduction operations of 
Sundaresan (see, for example, column 12, lines 10-15) to further include vectors formed from the 
basic data types. 
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